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INTRODUCTION 


In  developing  clutter  models,  Raytheon  used  smoothing  cubic  splines 
to  obtain  analytical  expression  for  the  clutter  amplitude  distribution 
functions . ^ Volume  III  of  Reference  1 discusses  Raytheon's  application 
of  cubic  splines . ^This  application  is  believed  to  be  unique.  This 
report  has  been  prepared  to  describe  in  detail  the  development  of  the 
smoothing  cubic  spline.  An  explanation  of  the  large  oscillations  in 
the  error  obtained  in  the  fit  or 'data  match  for  large  clutter  values  is 
given,  and  a method  that  would  avoid  the  oscillations  is  discussed. 

DISCUSSION 

Smoothing  cubic  splines  are  useful  tools  to  reconstruct  a 
curve  from  discrete  noisy  data  obtained  from  observations  of  a physical 
process  in  which  one  is  interested  only  in  a curve  fit  whose  ordinate, 


slope,  and  curvature  are  continuous  over  the  entire  range  of  abscissa 
values.  A detailed  exposition  of  the  smoothing  cubic  spline  developed 
in  Section  1.1.2  of  Appendix  B in  Volume  III  is  given  below  to  clarify 
the  procedures  used  by  Raytheon.  First  define  H(x)  * Log^  1 1 -F (x) J , 
where  F(x)  is  the  cumulative  distribution  of  the  observed  random  variable 
x.  The  sequence  |x^j  , i*l,2,...,  N of  observations  is  given,  and  the 
value  of  the  histogram  at  the  observed  x * x^  is  denoted  by  H(x^) . If 
t = log^g  (x/10),  a cubic  spline  S(t)  can  be  determined  in  the  following 
manner  as  a continuous  smoothed  fit  to  H(x) : Denoting  by  t,  , t , t , 

kl  V k3 

and  t^  four  as  yet  undetermined  t values  out  of  the  sequence  Jt^j  , 

i = 1,2,...,N  an  S(t)  defined  for  t.  s tst  is  to  be  found  that  has 

kl  k4 

the  following  properties:  In  each  of  the  three  t intervals  S(t)  is  a 

cubic  polynomial  in  t,  so  that  in  general  three  separate  cubics  can  be 
constructed.  These  cubics  are  constrained  so  that  S(t-  ) * S(tH-  ); 

S’(t-  ) - S’(t£  ),  and  S"(t-  ) - S"(tk  +)  for  j 3 1 and  2.  In  effect 
then  llhe  desired  function  S(^)  will  be^continuous  everywhere  and  possess 
continuous  first  and  second  derivatives  in  the  interval  t < t « t,  . 
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Moreover,  subject  to  these  constraints,  S(t)  is  to  be  chosen  to  minimize 
the  quantity  E,  where 


E - 2 W [ H(x. ) - S(t  ) 1 

i-1  ‘ 


and  in  which 


10  lo®lQ  (xi/l0) 


i = 1,2,. ...N 


W1  *(t2_t^/(tN  " tl) 

wi  ='(ti+l  ' ti)/(tN  “ tl)  * * 2,3, ...  ,N 

WN  “(CN  “ ' tl) 

The  four  members  of  the  sequence  jckj|  are  ter“*®d  the  knots  of  the  spline, 

i.e.,  the  points  at  which  the  various  continuity  properties  must  hold. 

If  S(t)  is  to  encompass  the  entire  range  of  values  of  t,  t = t. , and 

*1  L 

t.  = t„.  Before  discussing  the  manner  in  which  t.  and  t.  are  chosen 
k4  - N B k2  k3 

so  as  to  minimize  E,  the  way  the  three  cubics  are  obtained  for  fixed 

knots  so  as  to  minimize  E will  be  shown.  There  are  several  ways  to 

carry  out  the  analysis;  the  method  which  leads  to  the  simplest  calculations 

will  be  shown. 

3 

First  the  truncated  power  function  t is  defined  as 


Every  cubic  spline  S(t)  defined  over  t.  <tst, 

K1  k4 


with  interior  knots 


t^  and  tj^  has  a unique  representation 


p3(t)  + c1(t-tk2>^  + c2(t-tk^)^  , tk^<t<tk^ 


where  p^ (t > is  a cubic  polynomial. 
Hence, 


sl(t)  - a^t-t^)  + a2(t-tki)  + a3(t-tki)  ♦ a4. 


< tstL 


n 


S(t) 


sl(t)  + cl(t'tk2) 


t,  £ tfit,  . 


k3 


S-l (t)  + cx(t-tk  ) + c0(t-t,.  )' 


tkstStk  • 

k3  k4 
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Hence  the  criterion  function  that  is  to  be  minimized  has  the 


form 


2 - *3<VV  ‘a4'2 


■jf/j  [H<xj)  " al<tj‘tk1>3  •*2(Vtk1> 

k3 

+ * wj  [H(V  • ci<vV  - *i(tj-tk1>  - wv 


-a3(t. 
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+ I_k  »j  (Hfej)  - CjCtj^)3  - c^t.-t^)3  - a^-t^y 


The  determination  of  c^,  c2>  a^,  a2>  a^,  a^  is  made  easiest  by 
solving  for  c^  as  a linear  combination  of  a1 , a.,  a,,  a,  in  the  equation 

&£  ^ J H 

9^  * 0 as  one  of  the  six  minimizing  equations.  Specifically 


J-k2  [H(xJ)-a1(tJ-tki)3-«2(trtki)-a3(trtki)-«4| 

°i  ' k3  ; 
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The  equation  ■ 0 can  be  used  to  express  c2  as  another  linear 

combination  of  a^  a*,  a.j,  a Then  the  equations  ^ ^ aE  8£ 

9a.  8a_  9a  8a 
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yield  a linear  set  of  equations  requiring  only  the  inversion  of  a 
4x4  matrix,  or  the  equations  can  be  solved  directly  by  inversion  of  a 
6x6  matrix.  Finally,  the  location  of  the  variable  knots  t^  and  tj^ 
is  determined  by  those  values  which  minimize  E.  A variable  knot  computer 
program  was  used  by  Raytheon  to  determine  the  placement  of  t^  and  t^ 
which  minimize  E.  The  computer  program  is  a standard  routine  in  the 
International  Mathematical  and  Statistical  Libraries  (IMSL)  called  ICS 
VKU. 


An  improved  fit  would  have  been  obtained  by  fitting  a smoothing 
cubic  spline  directly  to  the  histogram  F(x) , i = 1,2,...,  N and  imposing 
the  boundary  conditions  on  the  spline  fit  S(x)  that  S(xi>  = S'(x^)  = 0 
and  S (xfl)  = 1 , S' (x^)  = 0 . 

With  the  relation  t * 10  log^g  (^-)  large  errors  in  the  fits 
obtained  to  the  function  G(t)  = log^g  |l“F(x)j  are  to  be  expected  for 
large  values  of  t.  The  discussion  below  gives  the  reason  for  the  large 
oscillations  in  the  error  obtained  in  every  fit  for  large  values  of  t. 
From  the  function  H(x)  = log^Q  |l"F(x)J,  the  relation 


H'(x) 


. 1O810  e 


f (x) 


1 - F (x) 

can  be  obtained  where  f(x)  denotes  the  probability  density  function  of  x. 
By  a double  application  of  L'Hospital's  rule,  the  conclusion  that  Lim 

v X—  OO 

- H'(x), 


H' (x)  = -«can  be  reached.  Since  G'(t) 


10  log10  e 


lim  G ' (t)  = -oo.  Moreover  the  lim  H(x)  = lim  G(t)  = - oo  . 

X— 00  X—OO  X— OO 

Since  a fit  G(t)  is  to  be  obtained  by  a smoothing  cubic  spline 
which  possesses  finite  ordinates  and  slopes  everywhere,  it  is  to  be 
expected  that  large  oscillations  would  occur  in  the  fit  for  large  values 
of  t.  This  is  shown  in  the  error  graphs  on  pages  B-27,  B-31,  B-35,  etc. 
of  Volume  III. 
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